Death and resurrection of the zeroth principle of thermodynamics 
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The zeroth principle of thermodynamics in the form "temperature is uniform at equilibrium" is 
notoriously violated in relativistic gravity. Temperature uniformity is often derived from the maxi- 
mization of the total number of microstates of two interacting systems under energy exchanges. Here 
we discuss a generalized version of this derivation, based on informational notions, which remains 
valid in the general context. The result is based on the observation that the time taken by any 
system to move to a distinguishable (nearly orthogonal) quantum state is a universal quantity that 
depends solely on the temperature. At equilibrium the net information flow between two systems 
must vanish, and this happens when two systems transit the same number of distinguishable states 
in the course of their interaction. 
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I. NON-UNIFORM EQUILIBRIUM 
TEMPERATURE 

According to non-relativistic thermodynamics, a ther- 
mometer (say, a line of mercury in a glass tube) , moved 
up and down a column of gas at equilibrium in a constant 
gravitational field, measures a uniform temperature. But 
this prediction is wrong. Relativistic effects make the gas 
warmer at the bottom and cooler at the top, by a cor- 
rection proportional to c^^, where c is the speed of light. 
This is the well known Tolman-Ehrenfest effect, discov- 
ered in the thirties [1, 2] and later derived in a variety 
of different manners [3-11]. The temperatures Ti and T2 
measured by the same thermometer at two altitudes hi 
and /i2 in a Newtonian potential are related by the 
Tolman law 
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The general-covariant version of this law reads 
r|^| — constant, 



(1) 



(2) 



where |^| is the norm of the timelike Killing field with 
respect to which equilibrium is established. 

A violation of the uniformity of temperature seems 
counterintuitive at first, especially if one has in mind a 
definition of "temperature" as a label of the equivalence 
classes of all systems in equilibrium with one another. 
In a relativistic context a physical thermometer does not 
measure this label and we must therefore distinguish two 
notions: (i) a quantity To defined as this label (propor- 
tional to the constant in (2)), and (ii) the temperature 
T measured by a standard thermometer. 

In the micro-canonical framework the entropy S{E) is 
the logarithm of the number of microstates N{E) that 
have energy E and T can be identified with the inverse 
of the derivative of S{E), 



dSiE) _ 1 
dE ^ kf' 
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by maximizing the total number of states N — N1N2 
under an energy transfer dE between the two. This gives 
easily Ti = T2 . In the presence of relativistic gravity, this 
derivation fails because conservation of energy becomes 
tricky: intuitively speaking, the energy dE reaching the 
upper system is smaller than the one leaving the lower 
system because "energy weighs" . 

Is there a more general statistical argument that gov- 
erns equilibrium in a relativistic context? Can the Tol- 
man law be derived from a principle generalizing the 
maximization of the number of microstates, without re- 
course to specific models of energy transfer, as is com- 
monly done in the derivations of the Tolman-Ehrenfest 
effect? 

In this paper we show that the answer to these ques- 
tions is positive, and we provide a generalization of the 
statistical derivation of the uniformity of temperature, 
which remains valid in a relativistic context. 

The core idea is to focus on histories rather than states. 
This is in line with the general idea that states at fixed 
time are not a convenient handle on general relativistic 
mechanics, where the notion of process, or history, turns 
out to be more useful [12]. Equilibrium in a stationary 
spacetime, namely the Tolman law, is our short-term ob- 
jective, but our long-term aim is understanding equilib- 
rium in a fully generally covariant context, where thermal 
energy can flow also to gravity [13-15], therefore we look 
for a general principle that retains its meaning also in 
the absence of a background spacetime. 

We show in this paper that one can assign an informa- 
tion content to a history, and two systems are in equi- 
librium when their interacting histories have the same 
information content. In this case the net information 
flow vanishes, and this is a necessary condition for equi- 
librium. This generalized principle reduces to standard 
thermodynamics in the non-relativistic setting, but yields 
the correct relativistic generalization. 

This result is based on a key observation: at tempera- 
ture T, a system transits 
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where k is the Boltzmann constant. The fact that two 
systems in equilibrium have the same T can be derived 



states in a (proper) time t, in a sense that is made precise 
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FIG. 1: Typical overlap between ^p{0) and ?/>(t) as a function 
of time. 



below. The quantity r was introduced in [13, 14] with 
different motivations, and called thermal time. Here we 
find the physical interpretation of this quantity: it is time 
measured in number of elementary "time steps" , where a 
step is the characteristic time taken to move to a distin- 
guishable quantum state. Remarkably, this time step is 
universal at a given temperature. Our main result is that 
two systems are in equilibrium if during their interaction 
they cover the same number of time steps. 



II. THE UNIVERSAL TIME STEP 

Consider a conventional hamiltonian system with 
hamiltonian operator H. Let V'(O) be the state at time 
zero and tpit) its evolution. What is the time scale for 
to become significantly distinct from "0(0)? The sep- 
aration of the state from its initial position is given by 
the overlap between "0(0) and 'il'{t)^ namely 



p{t)^\{m\m)\' 



(5) 



The typical behavior of P{t), for instance in the case of 
a semiclassical wave packet, is as in Figure 1. The state 
becomes rapidly distinguishable (nearly orthogonal) to 
the initial state, in a short time. Let us call to the char- 
acteristic decay time for the system self overlap. What 
is its value? The time to can be estimated by Taylor ex- 
panding P{t) for small times. The first time derivative of 
P{t) clearly vanishes at t = which is a maximum, there- 
fore we get the time scale from the second derivative. A 
straightforward calculation gives 



which implies a characteristic decay time 



to 



AE' 



(6) 
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FIG. 2: A phase space region moves from one cell to next in 
the time to ~ h/AE. 



in accord with the time-energy Heisenberg principle, and 
with the fact that energy eigenstates "do not change".^ 
The same conclusion can be reached also in the classi- 
cal theory. Consider a classical hamiltonian system with 
phase space F and hamiltonian H. For simplicity, say 
that the system has a single degree of freedom, so that 
F is two-dimensional. Let Ei and E2 be two (nearby) 
equal-energy surfaces and 7 a line joining the two sur- 
faces. Consider the motion of 7 under the time flow, see 
Figure 2. How long does it take for 7 to sweep a (small) 
phase-space volume V7 The answer is easy to find: the 
volume of the region R swept by 7 is the integral of the 
symplectic two form uj ^ dp A dq and its time derivative 
is 



dVjt) 
dt 
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where X is the hamiltonian time flow. This is given by 
the Hamilton equations, which can be written in the com- 
pact form 



-dH. 



Inserting this in the previous equation gives 



dV{t) 
dt 



dH ^ E2- Ei = AE. 



(9) 



(10) 



Now consider a small region of phase space, such as the 
blue region in Figure 2. Say that the volume of this 
region is h. How long does it take for this region to be 
carried along by the dynamics to a new position where 
the overlap with its initial location is negligible? It is 
clear that the answer is again (7). 

This is the same result as in the quantum theory: the 
time step to is the time taken generically to move from 
a state to a distinct (orthogonal) state. Indeed, a semi- 
classical state can be viewed as related to a Planck-size 
cell of the classical phase space, and the decay time of 
the quantum overlap P{t) is essentially the time the sys- 
tem moves from one Planck-size cell to the next. The 



^ Notice that this observation provides also a direct meaning to 
the time-energy uncertainty principle. 
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argument can be repeated with a bit more labour for a 
system with many degrees of freedom.^ 

Let us now consider a system in thermal equilibrium 
with a thermal bath at temperature T . Its mean energy 
is going to be kT and the variance of the energy is also 
going to be kT. Thus we have ^ kT. At a given 
temperature T, consider the time step 



Accordingly, in fi. = fc = 1 units temperature is measured 
in "states per second" and is nothing other than the num- 
ber of states transited by the system per unit of (proper) 
time. This is the general informational meaning of tem- 
perature. A warmer system is a system where individual 
states move faster across unit cells of phase space. 



to = 



kT' 



(11) 



According to the previous discussion, this is the average 
time the system takes to move from a state to the next 
(distinguishable) state. This average time step is there- 
fore universal: it depends only on the temperature, and 
not on the properties of the system. 



III. THERMAL TIME, TEMPERATURE AND 
THEIR PHYSICAL MEANING 



The dimensionless quantity 



t 



(12) 



measures time in units of the time step to, that is, it 
estimates the number of distinguishable states the system 
has transited during a given interval. For a system in 
thermal equilibrium, (11) gives 



kT 



(13) 



This same quantity was introduced with different moti- 
vations in [13, 14] under the name thermal time. It is the 
parameter of the Tomita flow on the observable algebra, 
generated by the thermal state. In the classical theory, 
it is the parameter of the hamiltonian flow oih — — In p, 
where p is a Gibbs state, vah — k — \ units. 

The argument in the previous section unveils the phys- 
ical interpretation of thermal time: thermal time, which 
is dimensionless, is simply the number of distinguishable 
states a system has transited during an interval. In a 
sense, it is "time counted in natural elementary steps", 
which exist because the Heisenberg principle implies an 
effective granularity of the phase space. 

Notice also that temperature is the ratio between ther- 
mal time and (proper) time [16] 



T 



hr 
'kt' 



(14) 



^ If we can diagonalize locally the state and the dynamics in 
energy-angle variables {E„,(pn), the phase space volume swept 
by the boundary of a given region, in a time dt, is dV ^ 
AE„{V/Vn)dt, where Vn is the phase space volume of the n- 
th degree of freedom. A coherent state has volume h in each of its 
degrees of freedom, giving dV ~ h"~^ J2n ^E„dt ^ h"~^ AEdt. 
Since a phase space cell has volume h", the time taken to move 
one cell is again ~ h/AE. 



IV. EQUILIBRIUM BETWEEN HISTORIES 

Let us come to the notion of equilibrium. Consider 
two systems. System 1 and System 2, that are in inter- 
action during a certain interval. This interaction can be 
quite general but should allow the exchange of energy be- 
tween the two systems. During the interaction interval 
the first system transits Ni states, and the second N2, in 
the sense illustrated above. Since an interaction channel 
is open, each system has access to the information about 
the states the other has transited through the physical 
exchanges of the interaction. 

The notion of information used here is purely physi- 
cal, with no relation to semantics, meaning, significance, 
consciousness, records, storage, or mental, cognitive, ide- 
alistic or subjectivistic ideas. Information is simply a 
measure of a number of states, as is defined in the classic 
text by Shannon [17]. 

System 2 has access to an amount of information 
Ii = logiVi about System 1, and System 1 has access 
to an amount of information I2 = log N2 about System 
2. Let us define the net flow of information in the course 
of the interaction as 51 = I2 — Ii- Equilibrium is by def- 
inition invariant under time reversal, and therefore any 
flow must vanish. It is therefore interesting to postulate 
that also the information flow 51 vanishes at equilibrium. 
Let us do so, and study the consequences of this assump- 
tion. That is, we consider the possibility of taking the 
vanishing of the information flow 



51 = 



(15) 



as a general condition for equilibrium, generalizing the 
maximization of the number of microstates of the non- 
relativistic formalism.'^ 



^ In the micro-canonical framework equilibrium is characterized by 
maximizing entropy, namely the number of micro-states sharing 
given macroscopic values. This is meaningful, e.g. under the 
ergodic hypothesis, according to which time averages can be re- 
placed by phase-space averages. In other words, if the ergodic 
hypothesis holds, a micro-canonical ensemble is essentially the 
family of states over which the single real individual microstate 
wanders. What we are doing here is essentially undoing this step 
and moving back from phase-space ensembles to actual histories. 
In the classical theory, there is a measure associated to a space- 
time volume and not to the length of a history. But in this paper 
we have shown that there is also a natural measure associated 
to the history of a quantum state. This allows us to backtrack 
from phase-space volume to number of steps along the history. 
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Let us see what this imphes. At equiUbrium 

Ni^N2. (16) 

Since the rate that states are transited is given by r and 
we assume a fixed interaction interval, the equihbrium 
conditions also reads 

Tl=T2. (17) 

Now, consider a non-relativistic context where two sys- 
tem are in equihbrium states at temperatures Ti and 
T2, respectively. In the non-relativistic limit, time is a 
universal quantity, which we call t. Therefore the con- 
dition (17) together with (13) implies that to — h/kT 
has the same value for the two systems and Ti — T2, 
which is the standard non-relativistic condition for equi- 
librium: temperature is uniform at equilibrium. On a 
curved spacetime, on the other hand, (proper) time is a 
local quantity ds that varies from one spatial region to 
another. Therefore thermal time is given by 

kT 

dT = j-ds. (18) 

In order for equilibrium to exist on a given spacetime, 
spacetime itself must be stationary, namely have a time- 
like Killing field ^, and an equilibrium configuration will 
be ^ invariant. Proper time along the orbits of ^ is 
ds = \^\dt where t is an afRne parameter for ^. Therefore 
thermal time is now 

kT 

dr^—m. (19) 

If two systems located in regions with different |^| are in 
thermal contact for a finite interval At, then they are in 
equilibrium if |^|T has the same value. This is precisely 
the Tolman law (2). Therefore the generalized first prin- 
ciple (15) gives equality of temperature in the non rela- 
tivistic case and the Tolman law in the general case. 

In static coordinates, ds^ = goo{x)dt'^ — gij{x)x^x^ and 
thermal time is proportional to coordinate time. The 
KilHng vector field is ^ = d/dt and |^| — ^/gQo. In the 
Newtonian limit 500 — 1 + 2$/c^ and we recover (1). 

Returning to the cylinder of gas in a constant gravita- 
tional field we see that during a coordinate-time interval 
At the proper times lapsed in the upper and lower sys- 
tems are different: identical clocks at different altitudes 
run at different rates. But the lower system is hotter, 
its degrees of freedom move faster in clock time from one 
state to the next. This faster motion compensates exactly 
the slowing down of proper time, so that upper and lower 
systems transit the same number of states during a com- 
mon interaction interval At. While a pendulum slows 
down in a deeper gravitational potential, at equilibrium 
all systems transit from state to state at the same com- 
mon rate, independent from the gravitational potential. 
This result provides a simple and intuitive interpretation 
of the Tolman effect. 



V. WIEN'S DISPLACEMENT LAW 

The Tolman-Ehrenfest effect is a small relativistic cor- 
rection, at the surface of the earth VT/T = 10~^*cm~^, 
and is not yet experimentally established. The principle 
proposed here also provides a mellifluous derivation of 
the well-tested Wien displacement law. 

Consider an isothermal cavity filled with electromag- 
netic radiation. A slow, adiabatic expansion of the cavity 
leaves the radiation in equilibrium throughout the ex- 
pansion process. During this expansion both the normal 
mode frequencies and the temperature of the radiation 
are adjusted. For the mode of frequency v, the condition 
of remaining in equilibrium is that the slow expansion 
take place on a time scale much greater than the period 
of the mode, i.e. t^xp » = ^/v- Hence, the relevant 
clock for this mode is its period. 

The condition that this mode be in equilibrium during 
the entire expansion history is that 

kT 

T = —if = const. (20) 

or expressed in terms of the mode's frequency, 
T 

— = const., (21) 

which is precisely the general form of Wien's displace- 
ment law. This special relativistic application of the 
principal proposed here plays an important role in the 
astrophysical determination of star temperatures. 

VI. CONCLUSIONS 

We have suggested a generalized statistical principle 
for equilibrium in statistical mechanics. We expect that 
this will be of use going towards a genuine foundation for 
general covariant statistical mechanics. 

The principle is formulated in terms of histories rather 
than states and expressed in terms of information. It 
reads: Two histories are in equilibrium if the net infor- 
mation flow between them vanishes, namely if they transit 
the same number of states during the interaction period. 

This is equivalent to saying that the thermal time t 
elapsed for the two systems is the same, since thermal 
time is the number of states transited, or, equivalently, 
is (proper) time in to units, where to is the (proper) time 
needed for a system to transit to an orthogonal state. 
The elementary (proper) time step to is given by h/kT 
and is a universal quantity for all systems at temperature 
T. 

In non-relativistic physics, time is universal and the 
above principle implies that temperature is uniform at 
equilibrium. On a curved spacetime, proper time varies 
locally and what is constant is the product of tempera- 
ture and proper time. 

Temperature admits the informational interpretation 
as states transited per second, consistent with the fact 
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that in h = k = 1 units it has dimension of second~^. 
Temperature is the rate at which systems move from state 
to state. 
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